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Abstract 

In this paper we introduce a new method to add a parameter to a family of distri- 
butions. The additional parameter is completely studied and a full description of its 
behaviour in the distribution is given. We obtain several mathematical properties of 
the new class of distributions such as Kullback-Leibler divergence, Shannon entropy, 
moments, order statistics, estimation of the parameters and inference for large sample. 
Further, we showed that the new distribution have the reference distribution as special 
case, and that the usual inference procedures also hold in this case. Furthermore, we 
applied our method to yield three-parameter extensions of the Weibull and beta dis- 
tributions. To motivate the use of our class of distributions, we present a successful 
application to fatigue life data. 

Keywords: Exp-G distribution; Order statistics; Fisher's information matrix; Exp- 
Weibull distribution; Exp-beta distribution. 

1 Introduction 

The present work is an enhanced and extended version of the pioneering manuscript presented 
at Estancia de Sao Pedro, Sao Paulo, Brazil, in the 18° SINAPE, 2008, see Barreto-Souza 
et al. (2008). 

In many practical situations, usual probability distributions do not provide an adequate 
fit. By example, if the data are asymmetric, normal distribution will not be a good choice. 
With this, several methods of introducing a parameter to expand a family of distributions 
have been studied. 

Marshall and Olkin (1997) introduced a new way to expand probability distributions and 
applied to yield a two-parameter extension of the exponential distribution which can serve 
as a competitor to such commonly-used two-parameter distributions as the Weibull, gamma 
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and lognormal distributions. Furthermore, this method was used to obtain a three-parameter 
extension of the Weibull distribution. Moreover, Mudholkar et al. (1996) introduced a 
three-parameter distribution alternative to the Weibull distribution, that has the Weibull as 
limiting distribution. 

Some methods of introducing of parameters to symmetric distributions have been stud- 
ied in order to add skewness. For instance, Azzalini (1985) introduced and studied the 
well-known skew-normal distribution, which is obtained by adding a shape parameter to 
the normal distribution. Another symmetric distribution that was extended by adding a 
skewness parameter was the Student's-t distribution by Jones and Faddy (2003). Finally, 
Ma and Genton (2004) introduced a general class of skew-symmetric distributions, whereas 
Ferreira and Steel (2006) provides a general perspective on the introduction of skewness into 
symmetric distributions. 

Recently, Jones (2004) introduced a class of distributions that adds two parameters to a 
reference distribution. Further, Jones and Pewsey (2009) advanced a four-parameter family 
has both symmetric and skewed members and allows for tail weights that are both heavier 
and lighter than those of the generating distribution. 

In this article, we introduce a new method to add a parameter to some reference distribu- 
tion. The resulting distribution exhibits the remarkable reciprocal property. We study this 
parameter in detail, and we give a full description of its behaviour in the distribution. The 
augmented distribution has several connections with the reference distribution, for instance, 
the Kullback-Leibler divergence of the augmented distribution with respect to the original 
distribution is finite and only depends on the new parameter. Several others properties in 
this direction are also given. The inferential aspects of this distribution are studied in de- 
tails, and two special cases are discussed, and a successful empirical application shows the 
flexibility the new distribution, and also motivates its usage. 

A special attention must be given to the fact that it is not straightforward that this new 
distribution contains the reference distribution as special case. We show that this is the 
case if we enlarge the parameter space, and also, that this enlargement is good, in the sense 
that, all the standard inferential procedures work if this new value in the parameter space 
is considered to be the true value of the parameter. 

The remaining of the article unfolds as follows: in Section [2] the new class of distributions 
is introduced, several properties are given, the new parameter is completely characterized, 
and the inferential aspects are discussed. Sections [3] and @] deals with two special cases: the 
exp- Weibull and exp-beta distributions, respectively. In Section an empirical application 
shows the usefulness of this distribution. Finally, Section [6] ends the article with some 
concluding remarks. The Appendix contains the proofs of the results presented in the article. 
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2 The new class of distributions 

The cdf of a random variable with truncated exponential distribution in the interval [0, 1] 
with parameter A is given by 

1 _ P — As 

where A > and x G [0, 1]. We now observe that F£(-) is a cdf for A G R \ {0}, and that 

limF A *(x) = x, x G [0, 1]. 

A->0 



Therefore, we extend the parameter space of the distribution above for the entire line: 

Fx(x) = 



F A *(x), if A ^ 0, 
x, if A = 0. 

We now define the new class as follows. Let G(x; 9) be the cdf of a continuous or discrete 
random variable with 9 being the parameters related to G, then the class of distributions 
exp-G, indexed by A, is defined by 

Ff(x) = F x (G{x ] 0)). (2) 

From now on, we will denote a random variable X with cdf (j2j) by X ~ exp-G(O), where 
Q = (X,9) T . ' ' 

If G(x; 9) is a cdf of a continuous random variable, then the exp-G distribution is abso- 
lutely continuous for every A ^ 0, and its probability density function (pdf), which is the 
derivative of the cdf (jlj) with respect to x, is given by 

f(x) = f x (x) = — ^-<7(z; 0) exp{-AG(x; 0)}, (3) 
1 — e A 

where #(•; 6 1 ) is the pdf associated to the cdf G(-; 6). Let G(x; 6) be a cdf of a discrete random 
variable taking values on the set {x\,x 2 , ■ ■ .}, where x\ < x% < ■ ■ ■ , then the corresponding 
exp-G distribution is also discrete, takes values on the same set for every A ^ 0, and its 
probability function is given by 

expj-XGjxj-i, 6)}- exp{-AG(^; 0)} 

where G(xq) = 0. 

If G(x; 0) is a absolutely continuous cdf, then its hazard function is given by 

h(x; 6) ee fe A (a;; 0) = X f X]9 l (5) 

1 — exp{— \S(x; v)\ 

where S(x; 9) = 1 — G(x; 9) is the survival function of a random variable with cdf G(-, 9). 

We now state several results regarding the relation between the exp-G and G distribu- 
tions, where proofs can be found in the Appendix. 
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Proposition 2.1 Let X and X\ have G distribution and exp-G distribution with parameter 
X, respectively. Let also fi be the law of X, and fi\ be the law of X\. Then, 

i) X and X\ have the same support for all A 7^ 0; 

ii) if X is continuous, singular or discrete, then X\ is continuous, singular or discrete, 
respectively, for all X ^ 0; 

Hi) fi « fi\, that is, fi\ is absolutely continuous with respect to fi. Moreover, the Radon- 
Nikodym derivative of fix with respect to fi is, almost surely, 

dfi\, . 1 exp{— XG(x — e)} — exp{— XG(x)} 

= 13^=1 G(x)-G(x-e) ' 



iv) if X is continuous, the relative entropy (Kullback-Leibler divergence) between X and 
X\ is 

dfi\ X ( X 



D K l(v 1 1 fi\) = - j log -jj^dfi = 1 - eX \ l ~ log 
v) if E(\X\ r ) < 00, then E(\X\\ r ) < 00, and, moreover, if X > 



1-e 



and if X < 

-^— x E(\Xn < E(\X X 
1 — e A 



We now give a characterization for our class of distributions through Shannon entropy. 
Such entropy were introduced by Shannon (1948) and, for a random variable X with density 
/(•), with respect to a cr-finite measure fi, usually the Lebesgue or counting measure, is given 

by ' 

H 5 (/) = - / f (x) log f(x)dfi. (6) 

Jaynes (1957) introduced one of the most powerful techniques employed in the field of prob- 
ability and statistics called maximum entropy method. This method is closely related to the 
Shannon entropy and considers a class of density functions 

¥ = {f(x):E f {T t (X)} = a t , i = 0,...,m} (7) 

where Ti(X), i = l,...,m, are absolutely integrable functions with respect to fdfi, and 
Tq(X) = «o = 1- In the continuous case, the maximum entropy principle suggests to derive 
the unknown density function of the random variable X by the model that maximizes the 
Shannon entropy in ([6]), subject to the information constraints defined in the class F. 
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The maximum entropy distribution is the density of the class F, denoted by / , which 
is obtained as the solution of the optimization problem 

f ME = argmaxH s (f). 

Jaynes (1957), in the page 623, states that the maximum entropy distribution f ME , obtained 
by the constrained maximization problem described above, "is the only unbiased assignment 
we can make; to use any other would amount to arbitrary assumption of information which 
by hypothesis we do not have." It is the distribution which should not incorporate additional 
exterior information other than which is specified by the constraints. 

In order to obtain a maximum entropy characterization for our class of distributions, 
we now derive suitable constraints. For this, the next result plays a important role. We 
will assume in the Propositions 12.21 and 12.31 that the reference measure, /x, is the Lebesgue 
measure, and that all the random variables involved are continuous. 



Proposition 2.2 Let G be the distribution of a continuous random variable, with pdf, g(-), 
and let X be a random variable with pdf /(•) given Then, we have that 

CI) E{logg(X;9)} = E {log g(G-\U; 9))} = f log g(G-\u;9))e- Xu du, 

1 ~~ e Jo 

C2) E{G(X;9)} = ±-^- [ 
and the Shannon entropy of /(■) is given by 

Mf) = 1 " pAy " lo S (r^) ~ E ^9(G-\U; 9))}, (8) 
where U follows truncated exponential distribution with parameter X and cdf given by (T7J). 

The next proposition shows that the class exp-G of distributions has maximum entropy 
in the class of all probability distributions specified by the constraints stated therein. 



Proposition 2.3 The pdf /(■) of a random variable X, given by ^j, is the unique solution 
of the optimization problem 

f = arg max M s (h), 
under the constraints CI and C2 presented in Proposition \2.1A 
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2.1 A as a concentration parameter 



We provide two asymptotic results of this class, by making the parameter A tend to ±00. 
This results will allow us to give an interpretation for this parameter. Since F\(x) — > x as 
A — > 0, we have, trivially, that if ~ exp — G and X G ~ G, then 

X? -A X G , 

as A — > 0, where — > stands for convergence in distribution. 

Therefore, the definition of the family exp-G by using fl2]) with A e M is good. This fact 
plays an important role in our paper because this makes the family exp-G contain G as par- 
ticular case. The following result is very important since regular distributions in Statistics 
enjoy many desirable properties. 



Proposition 2.4 If G is a parametric regular probability distribution, with parametric space 
Q, then so is the exp-G distribution, with respect to the parametric space Rx 6. 

Proof. 

The proof follows from a simple verification of the conditions given in Lehmann and Casella 
(2003). 

□ 

The distribution may present very different behaviour for large absolute values of A, thus 
showing that this is a rich class of distributions. 

Going further on the discussion of what happens when the absolute value of A is large. 
We begin by noting that F G will tend to one if A tends to infinity, whenever x is such that 
G(x) > 0, and will be zero otherwise. Therefore, if X\ follows a exp-G distribution, where 
G is any cdf, then 

X\ — >■ s a , 

as A 00, where a = inf{x;G(x) > 0}, ' — - stands for vague convergence, and 5 a is the 
Dirac's measure concentrated on a, that is, 5 a ({a}) = 1. Note that we needed to consider 
the vague convergence instead of convergence in distribution to allow a = —00. If a = —00, 
then 

where 1 is the function identically equal to one, which is not a probability measure. However, 
we may interpret this "probability measure" concentrated at —00, that is, if a 

random variable would follow 1, then pr(X < x) — 1 for all 16R. 

We now obtain the asymptotic behaviour of A — > —00. For this simple calculus 

argument allows us to conclude that F G will tend to zero, whenever x is such that G{x) < 1, 
and will be 1 otherwise. Therefore, if X\ follows a exp-G distribution, where G is any cdf, 
then 

X\ — — >■ 8 b , 
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as A — > — oo, where b = sup{x; G(x) < 1}. Note that we also needed to use the vague 
convergence to include the case where b = oo. In this case 



v 



where is the function identically equal to zero, which, again, is not a probability measure. 
However, we may, accordingly, interpret this "probability measure" concentrated 

at oo, that is, if a random variable would follow 0, then pr(X < x) = for all igR. 

We see from this result, that the parameter A can be interpreted as a concentration 
parameter, because it moves the exp-G distribution to a degenerated distribution in a (if a 
is finite), when it varies from zero to infinity, and to a degenerated distribution in b (if b is 
finite) when it varies from to minus infinity. Furthermore, if a equals minus infinity, the 
distribution moves towards the left side of the axis until the mass escape entirely, when A 
tends to infinity. Analogously, when b equals infinity, the distribution moves towards the 
right side of the axis until the mass escape entirely, when A tends to minus infinity. 

2.2 Reciprocal property 

This family of distributions enjoys a very interesting reciprocal property. We begin by 
introducing some notation, let X G ~ G, and 1/X G ~ S, where G is continuous. Therefore, 
we have that if X G ~ exp— G, then l/X G ~ exp— S. To see this, observe that, for A ^ 0, 



We also would like to remark that the reciprocal of X G has a corresponding exp-S distribution 
with -A, that is, has cdf F^(x) and has cdf F^ x (x). 

This means that whenever we study a special case of the exp-G distribution, we may 
easily study the reciprocal case. For instance, in this paper we study the exp-Weibull, and 
from this result, we also obtain several properties of the exp-Frechet distribution. 

2.3 Expansions, order statistics and moments 

We now give an useful expansions for the pdf ([3]). With this expansion, we can obtain math- 
ematical properties such as ordinary moments, factorial moments and moment generating 
function of the exp-G distribution from G distribution. Expanding the term e~ XG( - x ' 9 ^ in ()3]), 
it follows 



pr(l/X 



<x)= pr(Xf > 1/x) 



1 -exp{A(l -G(l/x))} 
1 — exp{A} 



F*_ x (S(l/x)). 




j=0 



(9) 



If G(-;9) has not closed-form, suppose 



oo 




(10) 



k=0 
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where {ctfcjfcLo * s a sequence of real numbers and cel. Several distributions do not have 
closed- form cdf and can be written in the form (ITU}) , we have, for instance, the normal, 
gamma and beta distributions. 

For n positive integer, we have 

' oo \ n oo 

^a k x k \ =^2c n , k x h , (11) 

vfc=0 / k=0 

where c nj0 = and c n , m = (ma ) _1 Y2=ii. nk ~ m + k)a k c n . m - k for m = 1,2,... (see 
Gradshteyn and Ryzhik, 2000). Using (flQj) and (TTTj) in fl9]), it becomes an useful expansion 
for ([3]) when G(-; 0) has not closed-form given by 

m = y—- X 9(x; o) { —r^ k+jc - (12) 

i,fc=o 

Let now Xi, . . . , X n be a random sample with pdf in the form ([3]) and define X, :n the ith 
order statistic. The pdf of the X i:n , say fi- n , is given by 

/*.•»(*) = m - 1 r- -r f(x)F(x) i - 1 {l-F(x)r- i 
B(t, n — i + 1) 

A#(s; 0)e AG(x ' e) ^ _ e _AG(x;e)|i-i| e -AG(x ; e) _ e -Aj.n-i ^ 



B(i,n- i + 1)(1 - e- A ) n 



By using binomial expansion for the terms {1 — e ^(^i 6 ')} 1 1 and {e XG ( x > e ) — e A }™ 1 in 
(PP . it follows 



n l \ e -\(n-k-i) 



v ' y j=0 fc=0 J v J 

x(l-e-^ +fc+1 ))/,, fc (x), (14) 

where fj,k{') denotes the pdf of a random variable with exp-G(A(j + k + 1), 0) distribution. 
Therefore, the pdf of Xj :n can be written as linear combination of pdf 's in the form ([3]) and, 
hence, the mathematical properties of the order statistics can be obtained from associated 
exp-G distribution. 

We hardly need to emphasize the necessity and importance of moments in any statistical 
analysis especially in applied work. Some of the most important features and characteristics 
of a distribution can be studied through moments, e.g., tendency, dispersion, skewness and 
kurtosis. We now give general expressions for the moments of the family exp-G of distribu- 
tions. 
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Consider X and Y be random variables with exp-G(A, 9) and G distributions, respec- 
tively. When G(., 9) has closed-from, an useful expression for the rth moment of the exp-G 
distributions it follows from and it is given in function of the probability weighted mo- 
ments of the Y: 



E(X r ) = -^-^ f; U¥- E {Y*G(Y-, &Y}. (15) 

3=0 J ' 

In particular, formula (Tl5|) provides us another proof of condition v) in Proposition 12.11 
If G(-,9) has not closed-from, from (Fl2|) we obtain the rth moment of X in function of 
the moments of Y: 

E(X r ) = T -^ ZI { -^ L c j , k E(Y r+k+ n. (16) 

In particular, if c is integer non-negative, the moments of X are given in function of the 
ordinary moments of Y . Finally, with the result ( TH|) the rth moment of the zth order 
statistic is given by 



F(X r ) S ~ e ' A) '" V V (-1) W+J -^ fi-l\ fn-i 

1 l:n> " B(i,n-i + l)^ o ^ o j + k + 1 



e -\{n-k-i) 



X 



:±-e^ +k+1) )E(Zl k ), (17) 



where Zj^ has exp-G(A(j + k + 1), 9) distribution. The expansions ([9]), ( |T2l and are 
main results of this Section and plays an important role in this paper. 



2.4 Estimation and inference 

Let X be a random variable with exp-G(A, 9) distribution, with A ^ 0. The log-density of 
X with observed value x is given by 

£ = £(X, 9) = log A - log(l - e- x ) + log g(x; 9) - XG{x; 9) 

and the associated score function is U = (d£/dX,d£/d9) T , where 

di - 1 1 G(r-O) d£ -II*(9) A ^(^) 

with U*(9) being the associated score function of the log-density of a random variable 
with pdf g(.,9). From regularity conditions, we have E{G(X; 9)} = A -1 — (e A — 1) _1 and 
E{dG(X;6)/d6} = \- 1 E{U*(6)}. 

The information matrix K = K((X, 9) T ) is 

X — ( Kx ' x Kx ' e ] 
\ K e,x ^e,e J 

9 



where 



For a random sample re = (27,..., x„) of size n from X and = (\,9) T , the total 
log-likelihood is 

n 
i=l 

where is the log-likelihood for the ith observation (i = 1, . . . ,n) as given before. The 
total score function is U n = U n (Q) = Y^=i U^ l \ where C/W for % = 1, . . . , n has the form 
given earlier and the total information matrix is K n {6) = nK(Q). 

The maximum likelihood estimator (MLE) © of O is obtained numerically from the 
solution of the non-linear system of equations U n = 0. Under conditions that are fulfilled 
for the parameter in the interior of the parameter space but not on the boundary, the 
asymptotic distribution of 

v^(e-e) ~ iwo^er 1 ), 



A 

where '~' stands for the asymptotic distribution. The asymptotic multivariate normal 
iVfc + i(0, K n (9) _1 ) distribution of can be used to construct approximate confidence re- 
gions for some parameters and for the hazard and survival functions. In fact, an 100(1 — 7)% 
asymptotic confidence interval for each parameter 0j is given by 



ACh = (e< - * 7/2 VK e " e s + z l/2 Vk @ ^), 

where k 0i ' ei denotes the ith diagonal element of i^ n (0) _1 for i = 1, . . . , k + 1 and 27/2 is the 
quantile 1 —7/2 of the standard normal distribution. The asymptotic normality is also useful 
for testing goodness of fit of the exp-G distribution and for comparing this distribution with 
some of its special submodels using one of the three well-known asymptotically equivalent test 
statistics - namely, the likelihood ratio (LR) statistic, Rao (Sr) and Wald (W) statistics. 
Consider the partition = (Of, Of ) T of the vector of parameters for the exp-Weibull 
distribution. The total score function U n = (Uf, U^) 7 " and the total Fisher information 
matrix and its inverse 

'11 Z^12 



21 K22 



are assumed partitioned in the same way as 0. The LR statistic for testing the null hypothesis 
H : ©! = Oj ^ versus the alternative hypothesis Ha '■ ©i 7^ ©^ is given by w = 2{£(<d) — 
£(Q)}, where and denote the MLEs under the null and the alternative hypotheses, 
respectively. The statistic w is asymptotically (as n — > 00) distributed as x 2 , where q is the 



dimension of the vector 6\ of interest. The score statistic for testing H is Sr = U^K u Ui, 
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where U\ and K 11 are the components of U n and K^ 1 corresponding to Q 1 evaluated at 
0. The score statistic Sr has asymptotically the \\ distribution and has an advantage over 
the LR since it only needs the estimation under the null hypothesis but requires the inverse 
Fisher information matrix. The Wald statistic for testing the null hypothesis Hq : ©i = ©^ 
is given by W = (@i — ©f ■ > ) T 7^ 11 ^ 1 ^(0i — ©f^), where K u is the component of the inverse 
information matrix K~ l corresponding to ©i evaluated at 0. The Wald statistic W has 
also under H an asymptotic x 2 q distribution. The Wald and score statistics are very used in 
practice and our derivation of the information matrix will be very convenient in modelling 
the exp-G distributions. 



2.5 Modified profile likelihood estimator 

Since A is a parameter added to some distribution, it can be seen as a nuisance parame- 
ter. With this in mind, we will advance a modified profile estimator for 9. From the last 
subsection, we have that 

di -U*(9) \ dG{x]9) 
d9~ U {9) - X ^9—> 

with 

„ fde\ 

B U) =0 - 

Therefore, if 8G(x;9)/89 T dG(x;9)/d9, that belongs to R, does not vanish for all values in 
some open neighbourhood of the true value of 9, let 

- f dG(x;9)dG(x;9) \- 1 dGjx^ 
X= \-dF d9~ I -d9^ U{dl 



with 



dX f 8G(x; 9) 8G(x; g) 1 f d 2 G(x; 9) dG(x; 9) \ dG{x; 9) 

89 { d9 T 89 J \ d9 T d9 T 09 J 89 T ^' 

f dG(x;6) dGjx^) }- 1 f dH^)) 8G(x; 9) dU*(9) 



d9 T 89 J [ 89 T 89 T w d9 T 89 

where 8 2 G(x;9)/89 T 89 T stands for the row vector containing the diagonal elements of 
the Hessian matrix of G, 8 2 G(x;9)/89d9 T , and OU*{9)/d9 stands for the column vector 
(dU*(9)/d9!,...,dU* k (9)/d9 k ) T . ' 

We, therefore, obtain the modified profile likelihood function: 

I = 1(9) = log A - log(l - e~ x ) + \ogg(x; 9) - XG(x; 9). 

The modified profile estimator for 9 can be obtained by maximizing £. Let V be the 
estimating equation given by 

V{9) - 89 - 1 89 ~ ^189 +U W- 89 G{X > ° ] " X ^9~> 
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one may also obtain the profile likelihood estimator by solving the equation V n {6) = J^™ =1 V^'(9) = 
0. 

2.6 Interest case A = 

We now discuss estimation and inference when A = 0. It is very important to discuss this 
case because we are interested in testing the hypotheses H : A = versus Hi : A ^ 0, i.e., 
to test if the exp-G fit is significantly better than G fit. The next result plays a important 
role in this paper. 

Theorem 2.5 Let F\(-) and f\(-) be the cdf and pdf defined by and ^j), respectively. 
The following conditions are true: 

i) If G is continuous then Fx — > G uniformly when A — > 0; 

ii) fx 9 uniformly when A — > 0, consequently £ n (X,8) — > t n {6), where t n (Q) is the 
log-likelihood associated to G; 

Hi) d£/d\ ->■ 1/2 - G(x; 6) and dl/BO ->■ U*(0), when A -> 0; 

iv) ka,a -»■ 1/12, K 6>e J dU*(9)/d9d9 T dG and n Xfi -(1/2, 1/2, 1/2) T ; when A -»■ 0, 
wift J" dU* {6) / dOdO 1 dG being the information matrix with respect to G; 

v) If G is regular and (Xq,9q) £ ©, t/jen 

v^{(A,r) T - (A ,O T } A ^^^(Ao,^)- 1 ); 

// G regular and (X ,9 ) G ©, i/ien i/te likelihood ratio, Wald and Score statistics 
has null asymptotic distribution \ 2 q , where q is the number of parameters estimated in 
alternative hypothesis minus the number of parameters estimated in null hypothesis. 

3 The exp-Weibull distribution 

We now move to the class of distributions exp-G, when G is the cdf of the Weibull dis- 
tribution, we will call this class of distributions by exp-Weibull. More precisely, to ob- 
tain the exp-Weibull distribution we put in (J2J) the cdf of the Weibull distribution G(x) = 
1 — exp{ — (x/j3) a }, where /3 > 0, a > and x > 0. Therefore, the cdf of the exp-Weibull 
distribution given by 

fW ^- exp{ -y^ )} . *>o. 
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From the general expressions ([3]) and (JSj) we obtain that the pdf and hazard functions 
are given by 



f{x) = - _ A ar°" exp{-A(l - e^ /w ) - (x//3) a }, x > 



and 



^ )= l-exp{-Ae-(^)°} ' X> °' 



respectively. 



We now illustrate the flexibility of this class of distributions by presenting some graphics 
of both the pdf and hazard functions. Figure [T] shows the plots of the pdf of the exp- 
Weibull distribution for some values of a and (3, and for A = — 5, — 1, 0, 1, 5, oo. We note 
that when the the value of A increases the pdf becomes more 'peaked'. Figure [2] contains the 
plots of hazard function of the exp-Weibull distribution for different values of a and and 
A = —5,—l,0,l,5,oo. We note that the behaviour of the hazard function of the Weibull 
distribution is close to the behaviour of the graphics with A = 1.0, and as the value of A 
increases, the behaviour of the hazard function of the exp-Weibull becomes very different 
from the behaviour of the hazard function of the Weibull distribution, showing that as the 
value of A gets larger the exp-Weibull "moves away" from the Weibull distribution, and gets 
closer to the Dirac mass at zero, as remarked on the end of the last Section. 



3.1 Order statistics and moments 

The pdf of the ith order statistic of a random sample from exp-W (A, (3, a) distribution is 
given by 

f . v Aa/3- a s a - 1 exp {-A(l - e~^ a ) - (s/fl)*} r _ A(1 _e-(./««M 
U - n[X) ~ B(i,n-i + l)(l-e-*)» I J X 

We will now obtain series representation for the moments of the exp-Weibull distribution 
and of the order statistics. To this end, let X be a random variable following a exp-Weibull 
distribution with parameters /3 > 0, a > and A > 0. From now on we will use the notation 
X ~ exp-Weibull (A, (3, a) to indicate this fact. 

We have the probability weighted moment of a random variable Y following Weibull distri- 
bution with parameter vector 9 = (/?, a) T can be written as E{Y r G(Y; 9Y} = (3 r J Q (— \ogu) r ^ a (l — 
u^du. Therefore, from ffl5|) it follows that the rth moment of X is 

W-r^E^" !\i-u)\-\o gu y/«du. (19) 
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Exp-Weibull(X,0.5,0.8) 



Exp-Weibull(A.,1.5,2) 



x = ™ 

X = 5 
X= 1 
X = 
X = -1 
X = -5 



o \ = °o 

--- X = 5 
X=1 

X = 

■-- X = -1 
X = -5 




Exp-Weibull(X,2,5) 




0.2 0.4 0.6 0.8 



Figure 1: Graphics of the pdf of the exp-Weibull distribution for some values of the param- 
eters. 

We now give an alternative expression to ( f!9|) more simple. The rth moment of X is 



E(X r ) = / x r f{x)dx 



Xa(3~ a 



-x 



r+ot— 1 



'0 JO 1 — e 

Now, expading expjAe^/^"} in Taylor's series we get 



exp{— A(l — e 



\„-A 00 \fc /-oo \„-A 00 

^ k\ L 1 - e- x 



1 - e 



—A 



fc=0 



A*£(n r ) 



where follows the Weibull distribution with parameters (k + l) 1 /"/ '(3 and a, and the 
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Exp-Weibull(X,0.5,0.8) 



Exp-Weibull(A.,1.5,2) 




Exp-Weibull(X.,1,10) 




x 



Figure 2: Graphics of the hazard function of the exp-Weibull distribution for some values of 
the parameters. 



interchange between the series and integral being possible due to Fubini's theorem together 
with the fact that we are dealing with positive integrand. Hence, we have that the rth 
moment of a exp-Weibull distribution can be written as 

Figure [3] shows skewness and kurtosis of the exp-Weibull distribution, obtained from 
application of the formula of the moments above, for /3 = 0.5 and some values of a as 
function of A. We now note from ( 120]) that all moments of the exp-Weibull distribution 
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tends to zero as A increases to infinity, which is a very remarkable fact. So, as we can note 
from Figure [31 as A increases, the skewness tends to zero, as well as the kurtosis, one more 
time reflecting the expected behaviour of the limiting distribution as A — > oo. 




Figure 3: Skewness and kurtosis of the exp-Weibull distribution for some values of the 
parameters. 

An expression for the rth moment of the ith order statistic of the exp-Weibull distribution, 
say X i:n , follows from ( ITT)) and ( 120]) : 



JK ' 1=0 j=0 k=0 ' ■ ' ' 



Mn+i-i+p W + k + l)} 1 , s 

l\(l + l)r/a+l ■ 

Expressions f|T9|) and fl2T|) show the importance of the expansions given in Subsection (12. 3p . 
Furthermore, result fl20|) shows that alternative expressions to f|T5l) and f|T6|) can be obtained 
depending of the G distribution. 

3.2 Order statistics and moments of the exp-Frechet distribution 

In this brief subsection we use the reciprocal property of the exp-G distributions to obtain 
expressions for the moments and order statistics of the exp-Frechet distribution. 

Let Y ~ exp-Fr(A, (3, a) and Yi- n be the ith order statistics from a random sample, of size 
n, of the exp-Frechet distribution. From formulae (120]) and (12T1) . we have that the moments 
of Y and Yi- n are 

F(Yn- A r(l-r/a) f. (~A) fc 
1 ; (3 r (e- x -l) f^k\(k + iy- r / a 
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and 



E{Xln) 



-\T(l-r/a) 



oo i — 1 n—i 



/3 r B(i,n-i + l)(l -e 



EEB-D 

1=0 j=0 k=0 



n+j—k—i 



i — 1 \ (n — i 



x 



u n+ , 7 - m) {-AQ- + fe + l)}' 

Z!(Z + l)l-r/a ' 

respectively, for r < a. 

3.3 Score function and information matrix 

Let 6 = (A, (3, a) T be the parameter vector and X random variable with exp-Weibull(A, (3, a) 
distribution. The log-density £ = 1(6) for the random variable X with observed value x is 
given by 



£ = —a log f3 + log(aA) 



A{l-e Vp) } -log(l -e~ A ) + (a- l)logx, x > 0. 



The score function is given by 



dl -(n\ a 1 

— = — 1 + e H 

d\ 1 — e A A 



1 

+ v 



dl_ 

da 



-i + i |) {i+Ae-(?r 



} 



i + log(s) - log(/3) - (|)° log {l + Ae"^)"} 



From the regularity conditions one obtains the following closed-form expressions 



i - 



i 



l-e> 



1 

A' 



E 



1 + A e 



"(f) c 



and 



log(f) {l + Ae-W)"} 



a 



-log(/9) + £{log(X)}. 



For interval estimation and hypothesis tests on the model parameters, we require the infor- 
mation matrix. We will, therefore, use some of the expressions above to obtain the Fisher's 
information matrix. The 3x3 unit information matrix K = K((X, (3, a) T ) is 



«A,A «A,/3 K \a 
/V = \ K\fi Kp :/3 KfS^a 

^\,a K>P,a ^a,a 
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whose elements are 

1 



and 



A 



«/3,a = « 



E 



K/3,/3 



(-1 + e 
log 



/3A 



J 



-- + log(/3)-£{log(X)} 



l-a\E<e~W) [j 



2a 



X 



+ E 



X\ a 

J 



ft 



log 



2 a 



-77 ) log -r 



/3 



- + log(/3)-£{log(X)} 

a 



1 - Ae~V7. 



These elements of the information matrix depend on some expectations that can be easily 
obtained through numerical integration. 



4 The exp-beta distribution 

Let Y be a random variable following standard beta distribution with parameters a > and 
b > 0. The cdf of Y is given by G(ac; (a,6) T ) = 4(a,6), where J x (a,6) = B^b)- 1 ^t a ' x {l- 
t) b ~ 1 dt denotes the incomplete beta function and B(a,b) = Lt a ~ x (l — t) fe_1 <it is the beta 
function. The exp-beta distribution is introduced by taking G as being the cdf of Y in (J2]). 
We will denote a random variable X with exp-beta distribution by X ~ exp-beta(A, a, 6). 

The pdf and cdf of the exp-beta distribution are given by 

/ W = B(a, t )(l- e ->) ^ 1 -^" e ^'-"- I£(0 ' 1) 

and 

F(x) = — 3—, i£(0,l), 

1 — e _A 

respectively. 

Figure H] shows the plots of the pdf of the exp-beta distribution for some values of a and b, 
and for A = — oo, —10, —3, 0, 3, 10, oo. Observe that for the exp-beta(A,2,l) distribution, the 
density of the beta(2,l) distribution is very close to a straight line, whereas the densities of 
the exp-beta(A,2,l) distributions may assume various shapes, such as unimodal, and strictly 
increasing. 
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Exp-beta(X,4,2) 



Exp-beta(A.,2,1) 




Exp-beta(A.,15,15) 




x 



Figure 4: Graphics of the pdf of the exp-beta distribution for some values of the parameters. 
The cdf of Y can be written in the form ( TTOj) . To see this, use the expansion 

^ x > Z^Y(b-k)k\ ' 

for |s| < 1 and b > real non-integer. If b > is integer, the index k in the above sum stops 
at 6—1. Hence, for beta distribution we obtain = (— l) k T(a + b)/ {T(a)T(b — fc)fc!(a + A;)} -1 
for b > real non-integer and c = a in (flO]) : if 6 > is integer, = for > 6—1. Therefore, 
the pdf of the exp-beta distribution can be expanded in the form ( 1121) . 
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4.1 Order statistics and moments 

Let Xi, . . . ,X n be a random sample from exp-beta(A, a, b) distribution and denote the ith 
order statistic as The pdf of X i:n , say fi, n , is obtained from ([TBI : 

\/i _ -\\-n „a-l/i _ „\b-l p -XIx(a,b) 

fm(x) = 1 [ , £ {l-e-^-^y-^e-^-C^-e-H"-*, 

1/1 ; S(i,n-i + l)S(o,6) 1 11 1 

for x e (0, 1). 

We now give an expression for the rth moment of the exp-beta distribution. Consider 
X ~ exp-beta(A, a, b). We seen that pdf of X can be written in the form (fT2|) . We have 
that E{Y V ) = B(v + a, b)/B(a, b), for v > 0. With this results, from (03J and (TIT]) it follows 
easily that the moments of X and Xj :n are given by 

A _(-A)J 
j,fc=0 

and 



*W - (1 . e -^(a,6) £ ^^(r + * + ic + a, 5) 



A(l - e->)-» f ^ (-i^W ^ - 1 yn - z 



B(a,b)B(i,n-i + l) ^ ^ ^ U \ j J\ k 

* l,m=0 j=0 k=0 



g-A(n-fe-i) 



x{-A(j + fc + l)} l q jTO S(r + m + Zc + a, 6), 

respectively. More once, we used the expansions of the Subsection ( 12. 3D and see its impor- 
tance. 

Figure [5] shows skewness and kurtosis of the exp-beta distribution, obtained from appli- 
cation of the formula of the moments above, for a = 2 and some values of b as function of 
A. 



4.2 Score function and information matrix 

Let X random variable with exp-beta(A, a, b) distribution and 9 = (A, /3, a) T be the param- 
eter vector, with A ^ 0. The log-density I = £(8) for the random variable X with observed 
value x is given by 

i = log A - log B(a, b) - log(l - e~ A ) + (a - 1) logx + (b - 1) log(l - x) - \I x {a, 6), 
for x e (0, 1). 



The score function is given by 

dl 1 1 



d\ A e A - 1 



Ix(a,b), 



91 t / n T / .\ x dl x (a, b) 

— = -fr a + tt(a + 6) + logs - A ^ % 

^ -r /, X -r / , X , /, X , dI X (a, b) 

— = -tt(6) + tt(a + 6) + log(l-s)-A ; 
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Figure 5: Skewness and kurtosis of the exp-beta distribution for some values of the param- 
eters. 



where ^(y) = d\ogT(y)/dy. 

Under the usual regularity conditions, the expected value of the score function vanishes. 
Hence, we obtain 

E{I x {a,b)}= l --^- [ , 
E j = \~ l {m(a + b) - *(a) + E(\ogX)} 



and 



E | ^M) | = A _!^ (a + h) _ ^ {b) + E{log{1 _ X)}L 

The Fisher's information matrix K = K((\, a, b) T ) is 



/tA,a K X,b 

K = I K,\ a K, a:a K a fi 
K X,b K a,b K-b,b 



whose elements are 



K ^ = X ~ 2 ~ T~, = + b) - *(a) + E(\ogX)}, 

(-1 + e A ) 

Kx , b = \^(a + b) - *(6) + £{log(l - X)}], 
Ka , a = *'(a) - + 6) + AS {^f^} - 

21 



« a , 6 = * / (6)-* / (a + 6) + AE 

and 

Kb,b 



d 2 I x (a,b) 



-V'(a + b) + A£ j 



db 2 

d 2 I x (a,b) 



dadb 

These elements of the information matrix depend on some expectations that can be easily 
obtained through numerical integration. 



5 Application 

Our aim in this Section is to motivate the use of the class exp-G of distributions by showing 
a successful application to one real data set. We here will fit exp-Weibull distribution to the 
data set given by Birnbaum and Saunders (1969) on the fatigue life of 6061-T6 aluminium 
coupons cut parallel to the direction of rolling and oscillated at 18 cycles per second. The 
data set consists of 101 observations with maximum stress per cycle 31,000 psi. The data 
are: 70, 90, 96, 97, 99, 100, 103, 104, 104, 105, 107, 108, 108, 108, 109, 109, 112, 112, 113, 
114, 114, 114, 116, 119, 120, 120, 120, 121, 121, 123, 124, 124, 124, 124, 124, 128, 128, 129, 
129, 130, 130, 130, 131, 131, 131, 131, 131, 132, 132, 132, 133, 134, 134, 134, 134, 136, 136, 
137, 138, 138, 138, 139, 139, 141, 141, 142, 142, 142, 142, 142, 142, 144, 144, 145, 146, 148, 
148, 149, 151, 151, 152, 155, 156, 157, 157, 157, 157, 158, 159, 162, 163, 163, 164, 166, 166, 
168, 170, 174, 201, 212. 



The MLEs and the maximized log-likelihood determined by fitting the exp-Weibull and 
Weibull distributions are 

= 55.670932, A = -41.645738, a = 1.642486, £ exp - W eibuii = -454.3272 

and 

f3 = 143.3150, a = 5.9790, £ Weibu u = -459.0999, 

respectively. 

We test the null hypothesis H Q : Weibull model against the alternative hypothesis Hi : 
exp-Weibull model. The LR statistic is 9.5453 and p-value = 2 x 10 -3 . Hence, for any usual 
significance level, we reject null model (Weibull) in favour of the alternative exp-Weibull 
model. In Figure [6] are displayed the empirical density, fitted exp-Weibull and Weibull 
densities. Hence, we see that the exp-Weibull distribution yields a better fit than Weibull 
distribution. 



6 Conclusion 

We defined a family of distributions that provides a rather general and flexible framework for 
statistical analysis. It also provides a rather flexible mechanism for fitting a wide spectrum 
of real world data sets. 
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Figure 6: Empirical density and fitted exp-Weibull and Weibull densities for the Birnbaum 
and Saunders's (1969) data set. 

Several properties of this class of distributions was obtained, such as Kullback-Leibler 
divergence between G and exp-G distributions, characterization based on Shannon entropy, 
moments, order statistics, estimation of the parameters and inference. 

With this, we moved to two special distributions, the exp-Weibull and exp-beta distri- 
butions, which were studied with some details. The article were motivated by a successful 
application to fatigue life data. 

Appendix 

In this appendix we prove Propositions 12.11 and 12.31 and Theorem 12.51 

Proof of Proposition 12.11 

i) and ii) follow directly from equation ([2]); 

For in) suppose that A > 0, the calculation being analogous for A < 0. Then, 



exp{— XG(x — e)} — exp{— XG(x)} 



pG{x) 



(1 - e~ x ){G(x) - G(x - e)} 



(l-e^){G(x)-G(x-e)}J G(x _ e) 



Ae 



dx 



< 



1-e- 




x — e 



(22) 



23 



A similar computation shows that 

exp{-AG(x-e)} -exp{-AG(x)} A 

(l-e->){G(s)-G(s- e )} " T3^I ex P{- AG (-)}- 

Therefore, if x is a continuous point of G, we have that 

1 exp{— AG(x — e)} — exp{— \G(x)} A 



lim — = —r exp{ — XG(x)}. 



1 - e" A e-*g - G(x - e) 1 - e" A 

Nevertheless, if a; is a discontinuity point of G, we have that 

1 exp{— \G(x — e)} — exp{— \G(x)} exp{— XG(x— )} — exp{— \G(x)} 

Mill 



1_e ~ A e>o° G(x)-G(x-e) (1 - e- x ){G(x) - G(x-)} ' 

Suppose that G is discontinuous at the points {x 1: . . .}, and let G c be the part of G with 
continuity points (the sum of the continuous and singular parts of G), then, it is easy to 
observe that 

i7G ( , vtntw^STi exp{-AG(gj_i)} - exp{-AG(^)} 
F x (x) = F x (G c (x)) + l{ Xl < x } l- e - x ' 

i=l 

where 1a(x) is the indicator function of the set A. We now, have that 



/ - 

J (— 00,21] 



1 exp{— \G(x — e)} — exp{— XG(x)} 



exp{— \G c {x)}dG c {x) 



1 1 - e~ x 

(—oo,x\ x ° 



+ E 1 ^ 



exp{-AG(x i _i)} - exp{-XG(xi)} 



<x}~ -1 \ 

- ' 1 — e~ A 

1 — exp{— \G c (x)} 1 exp{— AG(xj_i)} — exp{— AG(xj)} 

= t^=~ x +z^ 1 {^<-} ' 

1=1 

which concludes the proof of Hi). The proof of iv) is a simple application of Hi), and to prove 
f), one uses the Hi) and the inequality in equation (1221) . for A > 0, and a similar inequality 
for A < 0. 

□ 



Proof of Proposition 12.31 

Let z(-) be a pdf which satisfies the constraints CI and C2. The Kullback-Leibler divergence 
between z and / is 



Dkl{z\\S) = 




With this, we follow Cover and Thomas (1991) and obtain 

< D KL (z\\f) = / zlogzdx- / zlogfdx 
Jr Jr 

= -Ms(z)- [ zlogfdx. 
Jr 

With the definition of / and based on the constraints CI and C2, it is easy to see that 

f zlogfdx = + +l g(-A_J\ +E{logg(G- 1 (U;6))} 

Jr e — l \i — e J 

f]o g fdx = -m s (f), 



where U is defined as before. With this, we have 

W s (z) < M s (f), 

with equality if and only if z(x) = f(x) Lebesgue - almost everywhere, thus proving the 
uniqueness. □ 

Proof of Theorem 12.51 

(i) It is well-known from real analysis that if f n is a sequence of bounded and right-continuous 
functions that converge in all point for a continuous function, then this convergence is uni- 
form. Therefore, since F\ satisfies the above conditions and converges for G, which is a 
continuous function, the proof of (i) is completed. 

(ii) -(iv) The proofs are easily checked. 

(v) For (Xq,9q) G O, with Ao 6 R \ {0}, the result follows of the Proposition 12.21 and the 
Theorem 6.5.1 from Lehmann and Casella (2003). For Ao = 0, we have to use the results in 
(ii)-(iv) to adapt the proof given in Lehmann and Casella (2003) to our case. We begin by 
showing that 

y/nXA iV(0,12), 
when n — > oo. The following lemma will be useful in this proof. 



Lemma 6.1 Let f : [a, b) — > R (b can be oo). If f admits n derivatives to right around the 
point a, then 

f{x) = f(a) + fW(a){x -a) + ... + ^—^(5 - a) n , 
where f( k > represents the kth derivative of f to right and x £ (a,x). 
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The proof the above lemma is similar to usual proof, but replacing the derivative by the 
derivative from the right. It is clear that the same result holds for derivative from the left. 

Applying the Lemma to log-likelihood with respect to pdf f(x; A) = \e~ Xx / (1 — e~ A ), i.e. 
G(x; 6) = x, if follows that 

f(A) = £'(0) + X£" (0) + ^X 2 £'"(X) 

= - - z,- - —A + -X 2 £'"(X), 
2 ' 12 2 v ' 

3=1 

where f"(A) = -|A~ 3 - d 3 (e X - l)" 1 /^. 
By supposition, i'(X) = 0, thus 

/—\ -i/2 n /2 — Z]j=l x j 
JnX = n ' — . 

12- 1 - (2n)~ l Xl'"{X) 

As n~ 1 X£"'(X) ^ in probability and n~ 1 / 2 (n/2 - YT j= i Xj) 4 iV(0, 1), when n ->■ oo. 

The rest of the proof is analogous to the one given in Lehmann, where one may use the 
results in (ii)-(iv) to ensure that all the arguments holds true. 

(vi) It follows from asymptotic normality of O, see Lehmann and Romano (2008) for more 
details. 

□ 
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